Relevance Score of Triplets Using Knowledge Graph Embedding - The Pigweed Triple Scorer at WSDM Cup 2017

نویسندگان

  • Vibhor Kanojia
  • Riku Togashi
  • Hideyuki Maeda
چکیده

Collaborative Knowledge Bases such as Freebase [1] and Wikidata [2] mention multiple professions and nationalities for a particular entity. The goal of the WSDM Cup 2017 [3] Triplet Scoring Challenge was to calculate relevance scores between an entity and its professions/nationalities. Such scores are a fundamental ingredient when ranking results in entity search. This paper proposes a novel approach to ensemble an advanced Knowledge Graph Embedding Model with a simple bag-of-words model. The former deals with hidden pragmatics and deep semantics whereas the latter handles text-based retrieval and low-level semantics. 1. TASK INTRODUCTION Many entities usually have multiple professions or nationalities, and it is often desirable to rank the relevance of these individual triplets. The goal of the challenge was to compute a score in the range [0, 7] that measures the relevance of the statement expressed by the individual triplet compared to other triplets from the same relation. Participants were provided with a list of 385,426 entities along with five files, • profession.kb: all professions for a set of 343,329 entities • nationality.kb: all nationalities for a set of 301,590 entities • profession.train: relevance scores for 515 tuples (pertaining to 134 entities) from profession.kb • nationality.kb: relevance scores for 162 tuples (pertaining to 77 entities) from nationality.kb • nationality.kb: 33,159,353 sentences from Wikipedia with annotations of the 385,426 entities Apart from these, the participants were allowed to use any kind or amount of additional data (except for human/judgements). The output of this task was to generate relevance scores for all the triplets, 0 being the lowest relevance, and 7 being the highest.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Relevance Scores for Triples from Type-Like Relations using Neural Embedding - The Cabbage Triple Scorer at WSDM Cup 2017

The WSDM Cup 2017 Triple scoring challenge is aimed at calculating and assigning relevance scores for triples from type-like relations. Such scores are a fundamental ingredient for ranking results in entity search. In this paper, we propose a method that uses neural embedding techniques to accurately calculate an entity score for a triple based on its nearest neighbor. We strive to develop a ne...

متن کامل

Integrating Knowledge from Latent and Explicit Features for Triple Scoring - Team Radicchio's Triple Scorer at WSDM Cup 2017

The objective of the triple scoring task in WSDM Cup 2017 is to compute relevance scores for knowledge-base triples of typelike relations. For example, consider Julius Caesar who has had various professions, including Politician and Author. For two given triples (Julius Caesar, profession, Politician) and (Julius Caesar, profession, Author), the former triple is likely to have a higher relevanc...

متن کامل

Triple Scoring Using a Hybrid Fact Validation Approach - The Catsear Triple Scorer at WSDM Cup 2017

With the continuous increase of data daily published in knowledge bases across the Web, one of the main issues is regarding information relevance. In most knowledge bases, a triple (i.e., a statement composed by subject, predicate, and object) can be only true or false. However, triples can be assigned a score to have information sorted by relevance. In this work, we describe the participation ...

متن کامل

Triple Scoring Using Paragraph Vector - The Gailan Triple Scorer at WSDM Cup 2017

In this paper we describe our solution to the WSDM Cup 2017 Triple Scoring task. Our approach generates a relevance score based on the textual description of the triple’s subject and value (Object). It measures how similar (related) the text description of the subject is to the text description of its values. The generated similarity score can then be used to rank the multiple values associated...

متن کامل

Predicting Triple Scoring with Crowdsourcing-specific Features - The fiddlehead Triple Scorer at WSDM Cup 2017

The Triple Scoring Task at the WSDM Cup 2017 involves the prediction of the relevance scores between persons and professions/nationalities. The ground truth of the relevance scores was obtained by counting the vote of seven crowdworkers. I confirmed that features related to task difficulty correlate with the discrepancy among crowdworkers’ judgement. This means such features are useful for pred...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.08353  شماره 

صفحات  -

تاریخ انتشار 2017